A corpus based approach to find similar keywords for search engine marketing

نویسندگان

  • Muazzam Siddiqui
  • Mohammad Fayoumi
  • Nidal Yousef
چکیده

Automatic thesaurus generation is used by search engines for query expansion. The same concept is used by search engine marketing companies to suggest keyword terms to their clients to improve the client’s ratings for different search engines. This paper presents and evaluates a corpus based method to find similar terms. The corpus is generated by scraping websites in different categories. A feature selection method is developed that rewards category specific terms and penalizes terms shared by two or more categories. The similarity measure is decomposed into three distinct components, namely contextual, functional and lexical similarities. The contextual similarity measure finds terms that are found in the same context. Functional similarity finds terms on co-occurrence basis while the lexically similar terms share one or more words. An overall similarity measure combines the evidence from these three measures.

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Advertising Keyword Suggestion Using Relevance-Based Language Models from Wikipedia Rich Articles

When emerging technologies such as Search Engine Marketing (SEM) face tasks that require human level intelligence, it is inevitable to use the knowledge repositories to endow the machine with the breadth of knowledge available to humans. Keyword suggestion for search engine advertising is an important problem for sponsored search and SEM that requires a goldmine repository of knowledge. A recen...

متن کامل

Keyword Suggestion for Search Engine Marketing

This document contains research in the field of Search Engine Marketing, to be specific it is focused on finding, implementing and evaluating a technique that can be used for keyword suggestion. Based on a set of words describing the subject, a algorithm finds and ranks keyword suggestions that can be used to advertise with. The main goal is to find keywords that are non-obvious, which means th...

متن کامل

Examining the Impact of Contextual Ambiguity on Search Advertising Keyword Performance: A Topic Model Approach

Sponsored search advertising offers a more targeted way of marketing than traditional advertising. However, the context of consumer search is often unobserved and the prediction of it can be nontrivial. Consumer search contexts may vary even when consumers are searching for the same keyword. Due to the ambiguity of a keyword, a large portion of the ads displayed may fall outside a particular co...

متن کامل

Optimizing question answering systems by Accelerated Particle Swarm Optimization (APSO)

One of the most important research areas in natural language processing is Question Answering Systems (QASs). Existing search engines, with Google at the top, have many remarkable capabilities. But there is a basic limitation (search engines do not have deduction capability), a capability which a QAS is expected to have. In this perspective, a search engine may be viewed as a semi-mechanized QA...

متن کامل

QUICK: Queries Using Inferred Concepts from Keywords

We present QUICK, an entity-based text search engine that blends keyword search with structured query processing over rich knowledge bases (KB) with massive schemas. We introduce a new formalism for structured queries based on keywords that combines the flexibility of keyword search and the expressiveness of structures queries. We propose a solution to the resulting disambiguation problem cause...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

برای دانلود متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

عنوان ژورنال:
  • Int. Arab J. Inf. Technol.

دوره 10  شماره 

صفحات  -

تاریخ انتشار 2013